Before any processing of the textual content of a document image can be performed the text must be separated from the background of the image. Several thresholding algorithms have previously been proposed and are widely used in document processing. None have been shown effective at thresholding difficult documents where the background and foreground are non-uniform. In this paper we investigate the use of three global thresholding algorithms (Otsu’s, Kapur’s entropy and Solihin’s quadratic integral ratio (QIR)) as the first stage in a multi-stage thresholding algorithm for use in degraded document images. It is concluded that Otsu’s and Kapur’s algorithms do not work well for difficult documents as they tend to over-threshold the image, thu...
This paper presents a new technique for the binarization of historical document images characterized...
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. D...
Abstract: Rendering document images for scanning and printing applications typically involves binari...
A number of techniques have previously been proposed for effective thresholding of document images. ...
Binarization methods play a central role in document image processing. It is usually performed in th...
Documents are degraded normally by the disturbance caused in the background. These are known as blee...
A multi-stage approach is presented for thresholding document images, along with its application. Th...
Documents are archived and preserved in large quantities worldwide. Electronic scanning is a common ...
savakis @ kodak.com Two algorithms for document image thresholding are presented, that are suitable ...
Image binarization is the separation of each pixel values into two collections, black as a foregroun...
Abstract- Segmentation of text from poorly documented images is a very difficult task due to high mu...
Binarization of document images is one of the most relevant pre-processing operations, leading to a ...
This paper deals with an adaptive approach for binarization and enhancement of degraded documents. c...
An effective approach for extracting document images from a noisy background is introduced. The enti...
The old documents in Jawi script are being used widely for references. The hard copies of those scr...
This paper presents a new technique for the binarization of historical document images characterized...
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. D...
Abstract: Rendering document images for scanning and printing applications typically involves binari...
A number of techniques have previously been proposed for effective thresholding of document images. ...
Binarization methods play a central role in document image processing. It is usually performed in th...
Documents are degraded normally by the disturbance caused in the background. These are known as blee...
A multi-stage approach is presented for thresholding document images, along with its application. Th...
Documents are archived and preserved in large quantities worldwide. Electronic scanning is a common ...
savakis @ kodak.com Two algorithms for document image thresholding are presented, that are suitable ...
Image binarization is the separation of each pixel values into two collections, black as a foregroun...
Abstract- Segmentation of text from poorly documented images is a very difficult task due to high mu...
Binarization of document images is one of the most relevant pre-processing operations, leading to a ...
This paper deals with an adaptive approach for binarization and enhancement of degraded documents. c...
An effective approach for extracting document images from a noisy background is introduced. The enti...
The old documents in Jawi script are being used widely for references. The hard copies of those scr...
This paper presents a new technique for the binarization of historical document images characterized...
Document images are usually degraded in the course of photocopying, faxing, printing, or scanning. D...
Abstract: Rendering document images for scanning and printing applications typically involves binari...